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In this paper, we consider the problem of exploring structural regularities of networks by dividing 
the nodes of a network into groups such that the members of each group have similar patterns of 
connections to other groups. Specifically, we propose a general statistical model to describe network 
structure. In this model, group is viewed as hidden or unobserved quantity and it is learned by fitting 
the observed network data using the expectation-maximization algorithm. Compared with existing 
models, the most prominent strength of our model is the high flexibility. This strength enables it 
to possess the advantages of existing models and overcomes their shortcomings in a unified way. As 
a result, not only broad types of structure can be detected without prior knowledge of what type 
of intrinsic regularities exist in the network, but also the type of identified structure can be directly 
learned from data. Moreover, by differentiating outgoing edges from incoming edges, our model 
can detect several types of structural regularities beyond competing models. Tests on a number 
of real world and artificial networks demonstrate that our model outperforms the state-of-the-art 
model at shedding light on the structural features of networks, including the overlapping community 
structure, multipartite structure and several other types of structure which are beyond the capability 
of existing models. 

PACS numbers: 89.75.Fb, 05.10.-a 



I. INTRODUCTION 

Network provides a powerful tool for representing the 
structure of complex systems. These networks include 
social networks 0, 0, information networks @, 0], and 
biological networks [l], Much of recent research on 
networks actually aims to understand the structural reg- 
ularities and further to reveal the relationship between 
such structural regularities and the function of networks. 
For example, as a widely-studied structural characteristic 
of network, community structure is of high interest be- 
cause communities often correspond to functional units 
such as pathways for metabolic networks and collections 
of pages on a similar topic on the Web. 

Community structure is a kind of assortative structure, 
in which nodes are divided into groups such that the 
members within each group are mostly connected with 
each other. Contrary to community structure, multipar- 
tite structure is another important kind of structural reg- 
ularities observed in real world networks. Multipartite 
structure means that nodes of network can be divided 
into groups such that most of edges are across groups. 
Beside these salient structural characteristics, other types 
of structure are also observed in real world networks, such 
as hierarchical structure and core-periphery structure. 

However, existing methods mostly presume that cer- 
tain type of structure exists in the target network and 
then devote to detect such structure. This raises con- 
cerns to the reliability of the resulted structure. On one 
hand, the assumed structure may not match the intrin- 
sic structure of the target network and thus these meth- 
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ods are not applicable to these situations. On the other 
hand, several real world networks contain multiple types 
of structure simultaneously. Most existing methods are 
designed for certain type of structure and thus cannot de- 
tect the broad types of structure. In addition, several un- 
known types of structure may also exist in networks and 
a desired method should be able to detect such structure 
as well. Thus, it is the time to explore multiple types of 
structural regularities in networks. 

In the last decade, the identification of community 
structure has attracted much attention in various scien- 
tific fields. Many methods have been proposed and ap- 
plied successfully to some specific complex networks [g- 
[l6j . For review, the reader can refer to Ref. [l7|. These 
methods are from different perspectives, such as the cen- 
trality measures, modularity, link density, percolation 
theory, network compression, and spectral analysis. Re- 
cently, several generative models for network data are 
proposed to detect community structure [HI, [l9[ . These 
models view network structure as observed quantities and 
take communities as hidden groups of nodes. The com- 
munities are then identified by fitting the model to the 
observed network structure. For example, Ren et al. (2p| 
proposed a probabilistic model to uncover the overlap- 
ping community structure. This model assumes that the 
two end nodes of each edge are from the same community 
and this assumption is satisfied by the fuzzy membership 
of nodes. Zhang et al. [2l| applied the Latent Dirich- 
let Allocation (LDA, a well-known generative model) to 
social network analysis and gave a method to detect com- 
munity structure. The common drawback of these two 
models is that they can only uncover the community 
structure and fail to reveal other types of structural reg- 
ularities, e.g., multipartite structure. 

To characterize the hierarchical organization of net- 
works, Clauset et al. proposed the hierarchical random 
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graph model, which is capable of expressing both assor- 
tative and disassortative structure [23| • To explore more 
broad types of structure, Newman et al. proposed a 
mixture model for exploratory analysis of network struc- 
ture [221 . In this model, the nodes with similar connec- 
tion preference rather than the highly connected nodes 
are classified into the same group. In such a general 
way, this model can reveal several other kinds of struc- 
tural regularities beyond community structure. How- 
ever, this model fails to tell us which kind of structural 
regularities has been identified. More importantly, this 
model may produce a result which is a mixture of sev- 
eral types of structure, and thus the identified structure 
may not provide clear information about the structural 
regularities. The shortcoming of this model is attributed 
to that it only models the relationship between groups 
and nodes rather than the relationship among groups. 
Stochastic blockmodel provides an appropriate alterna- 
tive to the mixture model for exploring broad range of 
structural regularities. Karrer et al. utilized a degree- 
corrected stochastic blockmodel [24[ to investigate com- 
munity structure of network. Airoldi et al. gave a mixed 
membership stochastic blockmodel [25} to model net- 
work data. These work has demonstrated that stochastic 
blockmodel is a good choice for exploring regularities of 
network. However, the effectiveness of these models are 
limited by their inflexible model assumptions, e.g., the 
hard partition assumption or neglecting the directional- 
ity of edges. 

In this paper, we focus on exploring the intrinsic struc- 
tural regularities in network by dividing network nodes 
into groups such that the members of each group have 
similar patterns of connections to other groups. We pro- 
pose a general stochastic blockmodel (referred to as GSB 
model in this paper) to model the network structure. In 
this model, node groups are represented by unobserved or 
hidden quantities and the relationship among groups are 
explicitly modeled by a block matrix as the traditional 
blockmodels. Then, using the expectation-maximization 
algorithm, we fit the model to specific network data and 
thus detect intrinsic structural regularities of the network 
without prior knowledge of what type of regularity ex- 
ists in the network. Compared with existing models, the 
most prominent strength of our model is the high flexi- 
bility. This strength enables it to possess the advantages 
of existing models and overcomes their shortcomings in a 
unified way. As a result, not only broad types of structure 
can be detected, but also the type of identified structure 
can be indicated by the block matrix. In addition, our 
model can tell us the centrality of the node in each group 
and the mixed membership of nodes as well. 

Tests on a number of artificial and real world networks 
demonstrate that our model outperforms the state-of- 
the-art models at shedding light on the structural fea- 
tures of networks, including the overlapping community 
structure, multipartite structure and several other types 
of structure which is beyond the capability of existing 
models. 



II. THE MODEL 

Generally, a network with n nodes can be represented 
mathematically by an adjacency matrix A with elements 
Aij = I if there is an edge from node i to node j and 
otherwise. For weighted networks, is generalized to 
represent the weight of the edge from i to j. 

To investigate the structural regularities in network, 
we suppose that the n nodes of the network fall into c 
groups whose memberships are unknown, i.e., we cannot 
observe or measure them directly. In this paper, we pro- 
pose a statistical model to infer the group membership 
from the observed network structure. 

The model we used is a kind of stochastic blockmodel. 
Blockmodel is a generative model and has a long tradi- 
tion of study in the social science and computer science. 
For a standard blockmodel, a c x c matrix uj is gener- 
ally adopted such that the matrix element u rs denotes 
the probability that a randomly selected edge connects 
group r to group s, i.e., the tail node of the edge is from 
group r and the head node is from s. The advantage of 
blockmodel lies in that the matrix u> explicitly character- 
izes various types of connecting patterns among groups. 

In the standard blockmodel, the nodes in the same 
group are identical, i.e., each node in a group has equal 
probability to be the end node of an edge adjacent to the 
group. This constraint is relaxed in our model. Specifi- 
cally, for an edge with its tail node being from group r 
and its head node being from group s, we use 9 r i to de- 
note the probability that the tail node is i and 4> S j to de- 
note the probability that the head node is j respectively. 
In addition, we use g y and g y to denote respectively 
the group membership of the tail node and head node of 
the edge ey. 

Up to now, we have given all the quantities in our 
model. They can be classified into three classes: ob- 
served quantities {.Ay}, hidden quantities {~^y,^7"y}, 
and model parameters {w rs , 9 r i, 4> S j}- To simplify the no- 
tations, we henceforth denote by A the entire set {Ay} 
and similarly g, V, 6, (j> for {Ify}, {g y}, {w rs }, 
{d rl } and {<f> s j}. 

With our model, an edge ey is generated in the follow- 
ing process: 

1. Select two groups g~tj — r and gYj = s respectively 
for the tail node and head node of the edge with 
probability oj rs ; 

2. Draw the tail node i from the group r with proba- 
bility 9 ri ; 

3. Draw the head node j from the group s with prob- 
ability (f) a j. 

Summing over the latent quantities r and s, the proba- 
bility that we observe an edge eij can be written as 

Pr(ey \W, 6,4>)=y^ J Ursdri4>sj ■ (1) 
rs 
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Then, the likelihood of the observed network with re- 
spect to our model is 



PT(A\0J,9,<f>) = J] [J^WrA 



(2) 



Note that the self-loop edges are allowed and the weight 
Aij is taken as the number of multi-edges connecting 
node i to node j as done in many existing models in- 
cluding, for instance, the widely studied configuration 
model [261 ] . 

Intuitively, the parameter 9 r i characterizes the central- 
ity of node i in the group r from the perspective of out- 
going edges while <p S j describes the centrality of node j 
in the group s from the perspective of incoming edges. 
Differently from traditional blockmodels, by differentiat- 
ing these two kinds of centrality, our model can provide 
more flexibility to explore broad types of intrinsic struc- 
tural regularities in network. Note that the parameters 
w rs , 9 r i, <j) s j satisfy the normalization conditions 



= i, 



= 1. 



(3) 



r— 1 s— 1 



Now our task is to estimate the model parameters and 
to infer the unobserved quantities by fitting the model to 
the observed network data. The standard framework for 
such a task is likelihood maximization. Generally, one 
works not with the likelihood [ Eq. © ] itself but with its 
logarithm (log-likelihood) 



C = lnPr(A|w,i 



E^ ln 12 Ur ^ e 

ij \ rs / 



(4) 



The maximums of the likelihood and its logarithm are 
in the same place since the logarithm is a monotonically 
increasing function. 

Directly maximizing the log-likelihood is difficult be- 
cause of the inner sum over the unobserved quantities 
gij = r and g ij = s - Using Jensen's inequality, the 
maximization of the log-likelihood can be transformed to 
the maximization of the expected log-likelihood 

= E Pr (~^ 1 = = s\e tj ,uj,e,(j)) 



Ajj (lncj rs + \n0 r i + hup 



s J 



ijrs 



qijrsAij (lnw rs + hx9 ri + hi(f> s j), 



(5) 



where to simplify the notation we have defined qij rs = 
P r (yy = r i yij = s|eij, w, 0, 0), which denotes the prob- 
ability that one observes an edge with its tail node i 



from group r and its head node j from group s given the 
observed network and the model parameters. 

With the expected log-likelihood, we can give the best 
estimate of the value C and the position of its maximum 
represents the best estimate of the most likely values of 
the model parameters. Specifically, if the value of qij rs 
is known, we can find the values of the model param- 
eters lo, 9, <fi where C reaches its maximum. However, 
the calculation of gy rs requires the values of these model 
parameters. To address such a problem, an expectation- 
maximization (EM) algorithm is adopted. 

Under the framework of EM algorithm, we first calcu- 
late the value of qijrs by 



qijrs — 



Pr i~fij = r^jj = s,eij\u,0,i 

Pr(ey|w,M) 
uj rs Q r i(j) S j 



s ^rs^ri 



(6) 



Once we have the values of the qijrs, we can use them 
to evaluate the expected log-likelihood and hence to find 
the values of w, 9, <f> that maximize it. 

Introducing the Lagrange multipliers p, "f r and A s to 
incorporate the normalization conditions in Eq. ([3]), the 
expected log-likelihood expression to be maximized be- 
comes 



\ rs / r \ j 



By letting the derivative of C to be 0, the maximum 
of the expected log-likelihood occurs at the places where 



(8) 



Siirs Aij qij rs 

Sijs ^ijqijrs 
^2ir Ajj qijrs 
Eijr^mrs' 



Eqs. ((U and ([8} constitute our expectation- 
maximization algorithm. In the expectation step, 
the expected value of log-likelihood is calculated through 
evaluating the values of qijrs with Eq. ([6]). In the 
maximization step, the expected value of log-likelihood 
is maximized when the values of model parameters w, 
9, cf> are evaluated with Eq. Implementation of the 
algorithm consists merely of iterating Eqs. ([6]) and (|SJ) 
until convergence. 

When the algorithm converges, we obtain a set of val- 
ues for hidden quantity qij rs and model parameters to, 
9, </>. This set of values is self-consistent with respect 
to Eqs. © and ((5]). However, it is not always the place 
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where the log-likelihood reaches its maximum. In other 
words, the expectation- maximization algorithm may con- 
verge to local maxima of the log-likelihood. With differ- 
ent starting values, the algorithm will give rise to dif- 
ferent solutions. To obtain a satisfactory solution, it is 
necessary to perform several runs with different initial 
conditions and take the solution giving the highest log- 
likelihood over all the runs performed. 

By fitting the model to the observed network structure 
with the expectation-maximization algorithm, the esti- 
mated model parameters provide us vital information for 
structural regularities of the network. Specifically, 9 and 
4> describe the centrality of a node in groups containing it 
respectively from the perspective of outgoing edges and 
incoming edges. The parameter u characterizes the con- 
necting patterns among different groups, i.e., the type of 
structural regularities. 

More importantly, according to the model parameters, 
we can define two kinds of group memberships ai r and 
Pj a respectively from the perspective of outgoing edges 
and incoming edges. Specifically, ai r is the probability 
that node i is from group r when it acts as the tail node 
of edges while /3j s is the probability that node j is from 
group s when it acts as the head node of edges. For oti r , 
it can be calculated by 



Node: l...n [ 1 n 1 



(9) 



Actually, Oj r provides a soft or fuzzy membership, i.e., 
node % can belong to more than one groups simultane- 
ously. When the identified structural regularity corre- 
sponds to community structure, we actually obtain the 
overlapping community structure which has attracted 
much research attention ever since it is proposed. If one 
wants to get a hard partition, we can simply assign each 
node i to the group r satisfying r = arg max s {a.i S ,s = 
I, 2, ■ • • , c}. These statements for a r i also apply to f3i r 
defined as 



'rs'rsj 



J2rs ^rsfisj 



(10) 



Finally, the model described above so far is based on 
directed networks. Actually, the model can be easily gen- 
eralized to undirected networks by letting the parameter 
9 be identical to 4>. The derivation follows the case of di- 
rected networks and the results are the same to Eqs. © 
and ||SJ}. 

Now we discuss the computational cost of the 
expectation-maximization algorithm for the fitting of our 
model. For each iteration in this algorithm, the cost con- 
sists in two parts. The first part is from the calculation of 
qijrs using Eq. whose time-complexity is 0(m x c 2 ). 
Here m is the edges in the network and c is the number 
of groups. The second part is from the estimation of the 
model parameters using Eq. ([5]), whose time-complexity 
is also 0(mxc 2 ). We use T to denote the number of iter- 
ations before the iteration process converges. Then, the 
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FIG. 1: Generative models for network data: (a) Newman's 
mixture model [52], (b) Probabilistic model proposed in [2p| ]. 
and (c) our model. Filled circles represent observed quanti- 
ties and unfilled ones correspond to hidden quantities. The 
solid line (with arrow) between node i and j indicates the ex- 
istence of one (directed) edge connecting them. The dashed- 
line connecting two circles indicates that the relation between 
the corresponding quantities is unobserved and requires be- 
ing learned from the observed network data. Arrows represent 
the directions of relation. 

total cost of the expectation-maximization algorithm for 
our model is O (T x m x c 2 ) . It is difficult to give a theoret- 
ical estimation to the number T of iterations. Generally 
speaking, T is determined by the network structure and 
the initial condition. A well-designed initial condition 
may result in good convergence rate. The computational 
cost limits our model to deal with networks with mod- 
erate scale networks. We look forward to seeing more 
efficient implementation for our model. Note that the 
method proposed in [27| provides a promising way to 
improve the computational efficiency and decrease the 
memory space required. 



III. COMPARISON WITH OTHER MODELS 

In this section, we illustrate the difference and connec- 
tions between our model with several existing models. 
Fig. [T] gives the schematic for our model and two existing 
generative models, namely Newman's mixture model and 
Ren's model. 

For Newman's model, as shown in Fig.[Tfa), each group 
r is characterized by the connecting preference 8 r j to 
node j, no matter the node j is contained by the group r 
or not. The nodes belonging to the same group have sim- 
ilar connecting preference. As a result, both assortative 
and disassortative structural regularities can be detected 
by this model. However, this model has no parameter 
to explicitly characterize the type of the identified struc- 
ture. More importantly, this model may produce a result 
which is a mixture of several types of structure and thus 
in these cases the identified structure may provide con- 
fused information about the structural regularities. For 
example, for the network shown in Fig. [3J nodes 12, 15, 
16, 19, 21, 23 are identified by this model as overlapped 
nodes shared by the two groups, denoted by circles and 
squares, although these nodes only have connections to 
one of the two groups. 

For Ren's model, as shown in Fig. [TJb), the two end 
nodes of each edge is assumed to be from the same 
group. As a result, only the assortative structure (com- 
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FIG. 2: The network of the karate club studied by 
Zachary [2||. The real social fission of this network is repre- 
sented by two different shapes, circle and square. The shades 
of nodes indicate the mixed membership obtained by fitting 
our model to this network. The sizes of the nodes indicate 
the centrality degree of nodes with respect to the left group, 
i.e., 6 r i ranging from for the smallest nodes to 0.22 for the 
largest. 

munity) can be detected using this model. Note that, 
for this model, no edge is allowed to connect different 
groups. The relationship between communities is instead 
reflected by the overlapped nodes. 

For our model, it essentially is a kind of stochastic 
blockmodel, in which the relationships among different 
node groups are explicitly modeled by the block matrix 
w. In this way, our model possesses the advantages of 
both Newman's model and Ren's model and overcomes 
the shortcoming of these two models. 

On one hand, through learning the matrix w accord- 
ing to observed network data, various types of struc- 
tural regularities can be explored by our model. The 
type of the identified structure is indicated by the ma- 
trix w. Specifically, when the matrix uj is an identity 
matrix, the identified structural regularity corresponds 
to an obvious community structure. Meanwhile, multi- 
partite or anti-community structure is revealed when the 
estimated model parameter uj is an anti-diagonal matrix 
with all the anti-diagonal elements being 1. For other 
types of structure such as core-periphery structure and 
hierarchical structure, the form of w is the same to the 
block matrix uj in traditional block models [24j . 

On the other hand, using the matrix w, our model dis- 
cards the assumption of Ren's model that two end nodes 
of one edge are required to be from the same community. 
In this sense, Ren's model is a special case of our model. 
In addition, our model also provides several other flexi- 
bility. By representing the centrality of nodes in group 
from two different perspective respectively according to 
the outgoing edges and incoming edges, our model can 
detect more broad range of structural regularities which 
is out of the capability of other models. This will be 
shown later in the subsequent section. Moreover, our 
model can be further generalized by not requiring the 
matrix w be a square matrix. 

Finally, we compare our model to two recently pro- 



posed stochastic models for community detection [2J,|27|. 
Firstly, both our model and Karrer's model [24J are 
stochastic block model where a block matrix is adopted 
to characterize the connecting patterns among groups. 
The main difference between these two models lies in 
that Karrer's model is designed to detect disjoint struc- 
tural regularities while our model is for fuzzy structural 
regularities. This difference is reflected by the definition 
of the model parameters 9 and <f> in our model and the 
definition of the model parameter 6 in Karrer's model. 
In addition, our model differentiates the out-edges from 
in-edges of nodes while Karrer's model does not. Sec- 
ondly, similar to Ren's model, Ball's model [13] focused 
on the community structure while our model can uncover 
multiple types of structural regularities. 



IV. EXPERIMENTAL RESULTS 

In this section, we demonstrate the effectiveness of our 
model at exploring the structural regularities of networks 
by experiments on several real world or artificial net- 
works with various types of intrinsic structural regulari- 
ties. Then we discuss the model selection issue, i.e., how 
to determine the optimal number of groups. 



A. Detecting community structure 

The test network is the famous karate club network 
constructed by Zachary. This network characterizes 
the acquaintance relationship between 34 members of a 
karate club in an American University. A dispute arose 
between the club's administrator and its principal karate 
teacher, and as a result the club eventually split into two 
smaller clubs, centered around the administrator and the 
teacher respectively. The network and its fission are de- 
picted in figure [21 The administrator and the teacher are 
represented by nodes I and 33 respectively. 

By setting the group number c — 2, we fit our model to 
the karate club network data. The resulted matrix w is a 
2x2 identity matrix, indicating that the obtained struc- 
ture is community structure. Fig.[2]shows the two groups 
found by our model with the expectation-maximization 
method. As shown in Fig. [2] the shades of the nodes in 
the figure represent the values of an 1 , where group i is 
the left group. As we can see, our model assigns most of 
the nodes strongly to one group or the other. Actually, 
all but 6 nodes are assigned 100% to one of the groups 
(black and white nodes in the figure). If we simply divide 
the nodes into two disjoint groups by assigning each node 
i to the group r according to the belong coefficients at r , 



1 Since this network is an undirected network, the two kinds of 
belonging coefficient are identical, i.e., oti r = /3i r . 
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TABLE I: Mixed membership of overlapped nodes. 



Node ID 


an 






3 


0.49 


0.00 


0.49 


9 


0.70 


0.96 


0.70 


14 


0.24 


0.00 


0.24 


20 


0.33 


0.13 


0.33 


31 


0.71 


0.92 


0.71 


32 


0.83 


1.00 


0.83 



the resulting groups perfectly correspond to the real split 
of the club. 

In addition, Table Q] gives the belonging coefficient of 
the 6 overlapped nodes which are shared by the two 
groups. These overlapped nodes are nodes 3, 9, 14, 
20, 31, 32. Note that these overlapped nodes are of- 
ten misclassified by traditional partition-based commu- 
nity detection methods. For comparison, we also gives 
the mixed membership of these six nodes according to 
Newman's mixture model and Ren's model. As we can 
see, our model and Ren's model produce the same re- 
sults, which is attributed to the fact that Ren's model is 
a special case of our model. However, Newman's model 
behave very differently from the other two models. Ac- 
tually, for Newman's model, another 10 nodes are also 
assigned to two groups, e.g., nodes 12, 15. Such a result 
is counterintuitive to the real structure of this network. 
As a conclusion, our model performs better than New- 
man's model at detecting the overlaps between groups. 
Ren's model can only detect community structure while 
our model can detect other types of structural regulari- 
ties as illustrated in the following test. 



B. Detecting multipartite structure 

Now we illustrate the detection of multipartite or anti- 
community structure according to our model. The test 
network is the adjacency network of English words taken 
from Ref. [||. In this network, the nodes represent 112 
commonly occurring adjectives and nouns in the novel 
David Copperfield by Charles Dickens, with edges con- 
necting any pair of words that appear adjacent to each 
other at any place in the text. Generally, adjectives oc- 
cur next to nouns in English. Thus most edges in the 
network connect an adjective to a noun and the network 
is approximately bipartite, i.e., this network possesses 
anti-community structure. This can be seen clearly in 
Fig. |31 where the adjectives and nouns are respectively 
represented by circles and squares. 



qi r is defined in [221 as the probability that node i belongs to 
group r. 

is defined in |20t | as the probability that node i belongs 




o 



o 



o°o 00 



□ 



o 
o 



o 



o 



o 



to group r. 



FIG. 3: The adjacency network of English words. Node 
groups corresponding to adjectives and nouns are respectively 
denoted by circle and square. The shades of nodes indicate 
their belonging coefficient obtained by fitting our model to 
this network. 

Fitting our model to this network with c = 2, the re- 
sulted a; is a transposed 2x2 identity matrix, indicating 
that the identified structure is bipartite structure. The 
obtained two groups and node memberships are shown 
by the shades of nodes as shown in Fig. [3] We can see 
that most nodes are assigned to only one group, although 
there are several ambiguous cases corresponding to the 
nodes with intermediate shades. If we assign each node to 
its most preferred group, the resulted two disjoint groups 
well separate the adjectives from the nouns. In fact, 100 
of all the 112 nodes are correctly classified. This accu- 
racy is the same to the result given by Newman's mixture 
model. 

As a comparison, we also apply Ren's model to this 
network by setting the group number being 2. Only 60 
nodes of all the 112 nodes are correctly classified, similar 
to the accuracy of random assignment. The ineffective- 
ness of Ren's model at this network is attributed to that 
Ren's model presumes the existence of community struc- 
ture in the network while the intrinsic structural regular- 
ity is bipartite structure. 



C. Exploring other type of structural regularity 

In the previous tests, we have demonstrate that our 
model can be used to detect both the assortative struc- 
ture (i.e., community structure) and the disassortative 
structure (i.e., multipartite structure) without being told 
that which type of structural regularities exists in the tar- 
get networks. Now we will further show that our model 
can also detect other type of structure which can not be 
revealed by competing models. 

We consider the schematic network depicted in 
Fig. HJa). This network is constructed according to the 
rules in Fig. 0|b). Intuitively, according to the outgoing 
edges in this network, the nodes can be divided into two 
groups: {1,2,3,4} and {5,6,7,8}. Meanwhile, according 
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5-6 


3-4 
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1-4 


Yes 


No 


5-8 


No 


Yes 



(b) 



FIG. 4: A schematic network. The directed edges are placed 
according to the rules described the right table. 

to the incoming edges, the nodes of this network belong 
to another two groups: {1, 2, 5, 6} and {3, 4, 7, 8}. 

We apply Newman's model, Ren's model and our 
model to this schematic network. Limited by the assump- 
tions of models, both Newman's model and Ren's model 
fail to uncover the intrinsic structural regularity accord- 
ing the construction rules. For our model, the flexibility 
of model assumption enables it to accurately detect such 
type of structure. Specifically, by fitting our model to 
this network, the obtained 9 reveal the two groups indi- 
cated by the outgoing edges while the <f) reflect the two 
groups indicated by the incoming edges. 



D. Model selection issue 

In the previous tests, we need to specify the group 
number before fitting our model to network. However, 
the group number is unknown a prior for many cases. 
Thus it is helpful to give a criterion to determinate the 
appropriate group number for given network. This task 
is known as the model selection issue in statistics. We 
deal with this problem by using minimum description 
length principle, which is also used to handle the model 
selection issue in Ren's model. 

According to minimum description length principle, 
the required length to describe the network data is com- 
posed of two parts. The first part describes the cod- 
ing length of the network using our model. This coding 
length is — L for directed network and —L/2 for undi- 
rected network. The second part gives the length for 
coding model parameters. This part is — J2rs ma; rs — 
^ r j(ln6Vi+hi</vi) for directed network and — V, lnuv s — 
^2 ri lnftri for undirected network. In this way, the opti- 
mal c is the one which minimizes the total description 
length. 

As tests, we consider two real world networks with 
prior knowledge of the intrinsic group numbers. These 
two networks are respectively the journal citation net- 
work constructed in Ref. [29[ and the American football 
team network described in Ref. In the journal cita- 
tion network, each node corresponds to a journal and all 
the 40 journals are from four different fields: multidisci- 
plinary physics, chemistry, biology and ecology. Journals 
from the same field are more likely connected by citation 
relation. For the football network, nodes represent the 
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FIG. 5: Model selection results for (a) the journal citation 
network and (b) American football team network. 



115 teams respectively belonging to 12 conference and 
generally games are more frequent between members of 
the same conference than between teams of different con- 
ferences. 

As shown in Fig. [5j the number of intrinsic groups 
is correctly identified for the journal citation network. 
However, for the football network, 11 is the optimal num- 
ber of groups while the intrinsic number is 12. By check- 
ing the found node groups, we find that only 11 node 
groups have their identities, i.e., each group contains at 
least one node after assigning nodes to their most pre- 
ferred groups according to the obtained belonging coeffi- 
cient a or p. This indicates that the appropriate group 
number is 11 for the football network. In fact, many 
well-known community detection methods also identify 
11 communities. 



V. CONCLUSIONS 

In this paper, we have studied the exploration of in- 
trinsic structural regularities in network using a general 
stochastic blockmodel. Without prior knowledge, our 
model can not only detect broad types of intrinsic struc- 
tural regularities, but also can learn the type of identi- 
fied structure directly from the network data. Tests on a 
number of artificial and real world networks demonstrate 
that our model outperforms the state-of-the-art models 
at shedding light on the structural features of networks. 
The flexibility enables our model be an effective way to 
reveal the structural regularities of network and further 
to help us understand the relationship between structure 
and function of network. For potential application, our 
model can be used to predict the emergence or vanishing 
of edges in network. As future work, we will generalize 
our model by releasing the requirement that the block 
matrix is a square matrix and investigate the possible 
applications of the more flexible model. 
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